Paradise
Aligning Language Models Using Follow-up Likelihood as Reward Signal
Zhang, Chen, Chong, Dading, Jiang, Feng, Tang, Chengguang, Gao, Anningzhe, Tang, Guohua, Li, Haizhou
In natural human-to-human conversations, participants often receive feedback signals from one another based on their follow-up reactions. These reactions can include verbal responses, facial expressions, changes in emotional state, and other non-verbal cues. Similarly, in human-machine interactions, the machine can leverage the user's follow-up utterances as feedback signals to assess whether it has appropriately addressed the user's request. Therefore, we propose using the likelihood of follow-up utterances as rewards to differentiate preferred responses from less favored ones, without relying on human or commercial LLM-based preference annotations. Our proposed reward mechanism, ``Follow-up Likelihood as Reward" (FLR), matches the performance of strong reward models trained on large-scale human or GPT-4 annotated data on 8 pairwise-preference and 4 rating-based benchmarks. Building upon the FLR mechanism, we propose to automatically mine preference data from the online generations of a base policy model. The preference data are subsequently used to boost the helpfulness of the base model through direct alignment from preference (DAP) methods, such as direct preference optimization (DPO). Lastly, we demonstrate that fine-tuning the language model that provides follow-up likelihood with natural language feedback significantly enhances FLR's performance on reward modeling benchmarks and effectiveness in aligning the base policy model's helpfulness.
Bray Wyatt makes shocking return at WWE's Extreme Rules PPV
Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. Weeks of teases and vignettes featuring a white rabbit and cryptic messages paid off Saturday night at WWE's Extreme Rules pay-per-view at the Wells Fargo Center in Philadelphia. After Riddle defeated Seth Rollings in the fight pit, WWE announcers Michael Cole and Corey Graves were about to sign off the broadcast when the screen went black and shady characters began to appear in the crowd. "He's got the whole world in his hands," blared over the speakers and characters from Bray Wyatt's Firefly Fun House showed up in the crowd.
Artificial intelligence expected to have a big impact on white collar jobs
Better educated, better paid white collar workers will be the most affected by artificial intelligence (AI), according to a newly released report by the Brookings Institution. The report goes against previous findings of Brookings' and other research that shows less educated and lower-wage workers will be most impacted by robots. Stanford University researcher Michael Webb's approach was to take the text of patents to identify the capabilities of AI, and then quantify the extent to which each occupation involves these technologies. Webb used natural language processing to quantify the overlap between patent texts and job description text and came up with an exposure score for each job. Out of the 769 occupational descriptions Webb analyzed, 740 "contain a capability pair match with AI patent language, meaning at least one or more of its tasks could potentially be exposed to, complemented by, or completed by AI,'' the report noted. "Importantly, this does not mean such tasks will be ...